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Sir: 

BRIEF ON APPEAL FOR APPLICANTS 

I. INTRODUCTION 

This appeal is taken from the Examiner's final rejection of 
claims 1 through 18 in the Office Action dated January 28, 2003 and 
the Examiner's Advisory Action dated April 4, 2003, copies of which 
are attached hereto as Exhibits A and B, respectively. 

Each claim on appeal has been finally rejected under 35 U.S.C. 
§103 (a) as purportedly obvious over U.S. Patent No. 6,047,257 to 
Dewaele. Obviousness or nonobviousness of the claimed invention is 
the only issue presented by this appieal . 

As set forth in more detail below, the claims on appeal are 
directed to hands -free command and control of a dental imaging 
system wherein retrieval, display and manipulation of digital 
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images is voice-controlled. 

In contrast, the Dewaele reference relates to speech 
processing for entering identification data for a medical image to 
be identified. The claimed invention is neither shown nor 
discussed in' the Dewaele reference or other prior art of record. 

Applicants' brief on appeal is- due June 25, 2003. 
Accordingly, this Appeal Brief is being timely filed. 

II. REAL PARTY IN INTEREST 

The real party in interest is AFP Imaging Corp. by virtue of 
an Assignment executed by David Vozick and James Johnson on August 
16, 2001. This Assignment was recorded on January 22, 2002 at Reel 
12542, Frame 0501. 

III. RELATED APPEALS AND INTERFERENCES 

There are no other appeals or interferences known to 
Applicants, Applicants' legal representative, or assignee which 
will directly affect or be directly affected by or have a bearing 
on the Board's decision in the pending appeal. 



IV. STATUS OF THE CLAIMS 

Claims 1 through 18 are pending. Claims 1, 15 and 17 are 
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independent claims . 

The application was filed with claims 1 through 18 on August 
8, 2001. In a September 6, 2002 Office Action, the Examiner 
rejected claims 1-18 under 35 U.S.C. §103(a). Applicants filed a 
response to the September 6, 2002 Office Action, without amending 
the claims. On January 28, 2003, the Examiner finally rejected 
claims 1-18 under 35 U.S.C. §103(a). On March 28, 2003, Applicants - 
responded to the January 28, 2 003 final rejection, without amending 
the claims. In an April 4, 2003 Advisory Action, the Examiner 
indicated that Applicants' March 28, 2003 response to the January 
28, 2003 final Office Action was considered but was not deemed to 
place the rejected claims in condition for allowance. 

Accordingly, claims 1 through 18 define the subject matter of 
this appeal. These claims are as follows: 

1. An apparatus for hands-free command and control of a 
dental imaging system having a display monitor, a microphone and a 
storage device storing a plurality of dental images corresponding 
to a selected dental patient, comprising: 

a speech recognition unit which converts to electronic speech 
data a voice command received through the microphone to select one 
of the plurality of dental images for viewing; and 

a command and control processor for the electronic speech data 
received from said speech recognition unit, wherein said command 
and control processor causes the selected dental image to be 
retrieved from the storage device and then displayed on the display 
monitor . 

2. The apparatus of claim 1, wherein thumbnail 
representations of the plurality of dental images corresponding to 
the selected dental patient are displayed for selection by the 
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user . 

3. The apparatus of claim 1, wherein the plurality of dental 
images include intra-oral images, panoramic dental images, FOTI 
images and periodontic images. 

4. The apparatus of claim 1, wherein text, audio and video 
data are also stored in the storage device and available for 
selection to be displayed. 

5. The apparatus of claim 1, wherein the dental images are 
acquired from one of a dental computer connected device, video 
camera, digital scanner or X-ray storage device and stored in the. 
storage device. 

6. The apparatus of claim 1, wherein the storage device is 
connected to a computer network. 

7. The apparatus of claim 1, wherein the storage device is 
remotely located and connected through a network. 

8. The apparatus of claim 1, wherein the command and control 
processor is remotely located and connected through a network. 

9. The apparatus of claim 1, wherein the microphone is 
wireless . 

10. The apparatus of claim 1, wherein after the selected 
dental image is retrieved from the storage device and displayed on 
the display monitor, the command and control processor, in response 
to a second voice command received through the microphone and 
converted by said speech recognition unit, causes the selected 
dental image to be further processed according to the second voice 
command . 

11. The apparatus of claim 1, wherein after the selected 
dental image is retrieved from the storage device and displayed on 
the display monitor, the command and control processor causes a 
voice interface through a speaker to provide a set of options, for 
selection by a user, for further processing the selected dental 
image . 

12. The apparatus of claim 1, wherein the command and control 
processor causes a voice interface through a speaker to provide a 
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voice prompt to guide a user through selection of an appropriate 
dental image. 

13. The apparatus of claim 1, wherein the speech recognition 
unit includes a hardware module electronically coupled to the 
command and control processor. 

14. The apparatus of claim 1, wherein the speech recognition 
unit comprises a client-server speech recognition system. 

15. A dental imaging system, comprising: 
a microphone; 

a display monitor; 

a storage device, wherein the storage device stores a 
plurality of dental images corresponding to a selected dental 
patient; and 

a speech recognition command unit which converts to. electronic 
speech data a voice command received through said microphone to 
select one of the plurality of dental images for viewing, and 
processes the electronic speech data to cause the selected dental 
image to be retrieved from said storage device and then displayed 
on said display monitor. 

16. The system of claim 15, wherein the microphone is 
wireless . 

17. A method of hands-free command and. control of a dental 
imaging system, comprising the steps of: 

converting to electronic speech data a voice command from a 
user through a microphone to select for viewing one of a plurality 
of dental images stored in a storage device for a selected dental 
patient; and 

processing the electronic speech data to cause the selected 
dental image to be retrieved from the storage device and then 
displayed on a display monitor. 

18. The method of claim 17, wherein the microphone is 
wireless . 

A copy of claims 1 through 18 is attached hereto as Exhibit C. 
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V. STATUS OF AMENDMENTS 

Applicants have not submitted any claim amendments subsequent 
to the final rejection of the application. Claims 1-18, set forth 
above and* in Exhibit C, are the claims on appeal. 

VI . SUMMARY OF THE INVENTION 

Applicants' invention provides tools (in the form of apparatus 
and method) for hands -free command and control of a dental imaging 
system to select dental images to be retrieved from a storage 
device, displayed on a display monitor and manipulated. A voice 
command from a user is converted through a microphone to electronic 
speech data for selecting for viewing one of a plurality of dental 
images which are stored in a storage device for a selected dental 
patient. The electronic speech data is processed to cause the 
selected dental image to be retrieved from the storage device and 
then displayed on a display monitor. 

VII. ISSUE PRESENTED 

Whether the Examiner has presented a prima facie case that 
claims 1-18 are unpatentable under 35 U.S.C. § 103(a) over U.S. 
Patent No. 6,047,257 to Dewaele. 
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VIII. GROUPING OF CLAIMS 

For the purpose of this appeal, independent claims 1, 15 and 

17 stand or fall together. The dependent claims are believed to be 
nonobvious for at t least the very same reasons that independent 
claims 1, 15 and 17 are believed to be nonobvious. However, the 
dependent claims also recite additional features which are believed 
to be nonobvious . 

IX. ARGUMENTS 

A. U.S. Patent No. 6,047,257 to Dewaele fails to render 
obvious the invention set forth in claims 1-18. 

1. The Examiner's Position 

In the Final Office Action dated January 28, 2003 the Examiner 
maintained the rejection of claims 1-18 under 35 U.S.C. §103 (a) as 
allegedly unpatentable over U.S. Patent No. 6,047,257 to Dewaele 
("the Dewaele reference") . A copy of the Dewaele reference is 
attached hereto as Exhibit D. 

The Examiner stated that the Dewaele reference teaches an 
apparatus for hands -free command and control of a dental imaging 
system having display monitor, a microphone and storage device 
storing a plurality of dental images corresponding to a selected 
dental patient, comprising a speech recognition unit which converts 
to electronic speech data a voice command received through the 
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microphone to select one of the plurality of images for viewing; 
and a command and control processor for the electronic speech data 
received from the speech recognition unit, wherein the command and 
control processor .causes the selected image to be retrieved and 
displayed on the display monitor. 

The Examiner acknowledged that the Dewaele reference, while 
discussing displaying medical images in the field of radiology, so - 
that an attending physician can make his or her diagnosis and 
dictations transcribed and recognized by a speech recognition unit, 
does not specifically relate to the field of dentistry or to the 
hands-free display and manipulation of dental images. The Examiner 
alleged that it would have been obvious to. one with ordinary skill 
in the art at the time of the invention to modify the method and 
apparatus as taught by the Dewaele reference in the medical field 
to the field of dentistry because, according to the Examiner, one 
would readily realize that by using speech recognition with respect 
to a plurality of medical images, a hands-free environment is 
provided to display and manipulate those images. 

It appears to be the position of the Examiner that the Dewaele 
reference discloses all of the features of the claimed invention, 
except that the Dewaele reference relates to medical imaging and, 
as acknowledged by the Examiner in the January 28, 2003 Office 
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Action, does not disclose that its disclosure can be adapted to 
dental imaging. 

Applicants contend that the Examiner summarily concluded that 
the claimed invention would have been obvious, without addressing 
pertinent differences between the claimed invention and the Dewaele 
reference . 

2. Applicants' Position 

Applicants contend that the Dewaele reference does not 
establish a prima facie case of obviousness because, as discussed 
below, (i) the Examiner did not properly consider pertinent 
differences between the Dewaele reference and the claimed 
invention, and (ii) the Examiner has not shown a teaching or 
suggestion in the prior art or a motivation otherwise to modify the 
teachings of the Dewaele reference in a manner that would render 
the claimed invention obvious. Since the Examiner has not made a 
prima facie case of obviousness, the rejection of claims 1-18 
should be reversed, in accordance with applicable case law. See , 
e.g. , In re Fritch , 972 F.2d 1260, 1265, 23 U.S.P.Q.2d 1780 (Fed. 
Cir. 1992) (mere fact that prior art can be modified in manner 
suggested by the Examiner does not make the modification obvious 
unless the prior art suggested the desirability of the 
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modification) . 

a. The Examiner did not properly consider 
pertinent differences between the prior art 
• and the claimed invention 

It is submitted that the Examiner did not properly analyze the 
differences between the Dewaele reference and the claimed invention 
since the Examiner apparently did not give proper consideration to 
(1) the problem confronting the inventors to which the claimed 
invention is directed, and (2) other pertinent differences between 
the Dewaele reference and the claimed invention. 

The problem confronting the inventors is relevant to the scope 
of the prior art and whether a cited reference is pertinent to the 
claimed invention. See Monarch Knitting Machinery C orp. v. Sulzer 
Morat GmbH , 139 F.3d 877, 881-882, 45 U.S.P.Q.2d 1977 (Fed. Cir. 
1998) ; Heidelberger Druckmaschinen AG v. Hantscho Commercial 
Products, Inc. , 21 F.3d 1068, 1072, 30 U.S.P.Q.2d 1377 (Fed. Cir. 
1994) . In addition, a finding of obviousness based on improper 
analysis of the differences between the prior art and the claimed 
invention cannot stand and should be reversed. See Smiths 
Industries Medical Systems, Inc. v. Vital Signs, Inc. , 183 F.3d 
1347, 1355, 51 U.S.P.Q.2d 1415 (Fed. Cir. 1999) (obviousness finding 
required reversal since finding was based on improper analysis of 
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distinguishing feature which could not be found in, nor was 
rendered obvious .by /: the prior art) . 

The Examiner did not give due consideration to the problem 
confronting the inventors to which the claimed invention is 
directed, as compared to what is disclosed and taught in the 
Dewaele reference. 

The problem confronted by the inventors here is the risk of 
infection to a dental patient caused by manual operation of 
computer input devices of a dental imaging system while attending 
to the patient. 

The object of the present invention is hands -free command and 
control of dental images in a dental imaging system. In many 
instances, a dentist (or another dental care professional) needs to 
refer to one or more of a patient's plural dental images, while 
attending to the patient. The dental images may include, for 
example, intra-oral images, panoramic dental images, FOTI images 
and periodontic images. In some dental office practice, these 
images are stored electronically in a storage device of a dental 
imaging computer system. Conventional dental imaging systems 
typically require manual operation of computer input devices in 
order to specify and cause the specified dental image to be 
retrieved and displayed. 
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As discussed in the application at, for example, page 5, lines 
19-27, computerized voice recognition in the methods and 
apparatuses of this application is provided to enable a dentist or 
dental technician r to specify, through spoken commands, dental 
images to be retrieved from storage in a computer system and 
displayed and/or manipulated, without requiring the 
dentist/technician to manually operate computer input devices. 

In addition, as discussed at the application at page 4, lines 
9-26, a user's voice commands can be processed through a .voice 
interface for user selection of options for image processing of the 
retrieved image. For example, a user can command the system to 
manipulate the image, such as rotate, resize (e.g., increase image 
size to full screen, increase or decrease image size by a specified 
percentage, etc.) or move the image on the display, pan or zoom the 
image, change the brightness, contrast, color preferences or other 
color processing settings of the image, select a region of the 
image for manipulation, etc. Thus, the dental professional can 
cause the desired dental images to be retrieved, displayed and 
manipulated, while continuing to use her/his hands for attending to 
the patient, without risking contamination from manually operating 
computer input devices. 

In contrast, the Dewaele reference does not even purport to be 
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directed at reducing risk of infection. Instead, the Dewaele 
reference is concerned with speed and accuracy of entry of 
identification data which is to be associated with a medical image. 
See, for example, the Dewaele reference, column 1, line 1 through 
column 3, line 22. As discussed in the Dewaele reference, column 
2, lines 2 8-33, the setting under which the subject matter of the 
Dewaele reference is performed is that a radiologist or operator 
.will perform a radiographic exposure of a phosphor screen in a 
cassette and transport the cassette to an identification station, 
where the identification data of the patient are entered into an 
identification software program running on the identification 
station. Since one of ordinary skill in the art is told by the 
Dewaele reference that the cassette is transported to another 
location (and thus the identification data entry process does not 
occur during a medical procedure) , the skilled, person would not 
understand the Dewaele reference as relating to control of the risk 
of infection. 

In addition, Applicants contend that the Dewaele reference 
does not disclose or suggest the invention claimed in the present 
application, for at least the following reasons. 

For example, independent claim 17 of the present application 
relates to a method of hands- free command and control of a dental 
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imaging system. The method includes converting to electronic 
speech data a voice command from a user, to select for viewing one 
of a plurality of dental images stored in a storage device for a 
selected dental patient, and processing the electronic command data 
to cause the selected dental image to be retrieved from the storage 
device and then displayed on a display monitor. Similar features 
are recited in independent claims 1 and 15. The claimed invention 
recited in dependent claims provide in addition for further 
processing and image manipulation of the retrieved image. 

In contrast, the Dewaele reference, as understood by 
Applicants, relates to providing identification information to be 
associated with an image on a photo-stimulable phosphor screen. 

The terms "identification inf ormation" and "identification 
data" are defined in the Dewaele reference to be data identifying a 
patient to which a medical image pertains, data identifying the 
examination type that is performed or is going to be performed, and 
other data that are commonly associated with a medical image, such 
as the name of the radiologist, the sex of the patient, etc. (see 
the Dewaele reference, col. 1, lines 14-21, and col. 7, lines 23- 
26) . Conventional systems for entry of identification information 
through use of speech recognition, such as proposed by the Dewaele 
reference, are disclosed at page 2, lines 3-10 of the application. 
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The Dewaele reference discloses that identification 
information may be entered through a microphone and speech 
recognition to expedite the data entry process while reducing the 
probability of data entry error. The Dewaele reference does not 
disclose or suggest, however, use of speech recognition and voice 
command processing in the selection, retrieval for display and 
image manipulation of a computer- stored dental (or medical) image. 

The January 28, 2003 Office Action cites the Dewaele 
reference, column 9, line 3 9 through column 10, line 65, as alleged 
support that the Dewaele reference discloses a command and control 
processor for causing a selected image to be retrieved and 
displayed on a monitor. The Dewaele reference, Fig. 1, elements 4 
and 6-8, column 5, line 1 through column 6, line 6, column 7, lines 
19-55, column 9, line 39 through column 10, line 65, are cited in 
the Office Action as alleged support that the Dewaele reference 
teaches processing a voice command received through a microphone to 
select one of the plurality of images for viewing, and discloses 
manipulation of images corresponding to a dental patient, through 
voice recognition of voice commands. Applicants respectfully 
disagree . 

Figure 1 of the Dewaele reference shows a speech 
recognition/synthesis subassembly 4, an antenna 6, a cassette 7 on 
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which a radiographic image is to be recorded, and a radiof requency 
tag 8 on which identification data can be stored. None of the 
elements are shown or disclosed as being associated with processing 
voice commands to select images for viewing and image manipulation. 

The Dewaele reference discusses at col. 5, lines 1-65, the 
advantages of using automatic voice recognition for receiving and 
processing identification information. The. Dewaele reference, col. 
4, line 65 through col. 6, line 9, is repeated below for the record 
for purposes of completeness : 

A strong prejudice has existed against the application of data 
input via speech for identification purposes. Speech recognition is 
difficult primarily because of variability, which comes in different 
forms: (1) variability of sounds (different words, phrases or 
subword units) , (2) transducer/channel variability. Further there is 
a risk of interference with background noise from extraneous speech 
or transient acoustic events. 

In the field of medical images these prejudices have been 
overcome because : 

(1) the number of words in a medical identification task is 

restricted to a vocabulary of at most 100 single and 
isolated words so that the variability of sounds is 
limited. 

(2) transducer/channel variability including differences in 

signal characterisation is limited since the input is 
always via microphone, the characteristics of which are 
known at design stage. Thus, the voice recognition 
system need not be able to cope with a variety of 
sources . 

(3) the risk of interference with background noise from 

extraneous speech or transient acoustic events is 
limited on a radiology department since the voice input 
is under software control of the application and is 
restricted to well defined time slots in the course of 
operation. 

Significant advances in several technologies and application 
areas pertinent to voice processing have made feasible automatic 
voice recognition, such as (1) smart microphones adapting to any 
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acoustic environment and giving optimum signal-to-noise ratio in 
noisy backgrounds (2) acoustic echo cancellation to provide echo- 
free communications (3) advances in algorithms and DSP 
implementation of these algorithms providing high performance on 
reasonable cost platform. Although the sources of variability cannot 
be eliminated in general, speech recognition technology has reached 
a point to model and handle them properly. These models are based on 
(1) standard pattern recognition or (2) on hidden Markov models. 

The first class computes a best match similarity score between 
a spectral pattern of features against a database of stored 
vocabulary patterns. These spectral patterns model differences 
across different speakers and variance statistics derived over the 
time duration of the word. The second class of models calculates the 
highest likelihood score for a probabilistic model for each word of. 
a vocabulary of words . 

Voice processing has proven to. be very well suited for the 
purpose of identification in a hospital environment or specifically 
in a radiology department for the following reasons. 

First, the speaking format, that is the mode of speaking to 
the machine has limited complexity : it will basically fall into one 
of the following categories: 

(a) isolated word recognition (each spoken command or data 

entity represents one single word) or 

(b) connected word mode (the operator uses fluent speech but 

with highly constrained vocabulary) or 

(c) continuous speech mode (the operator dictates phrases or 

performs a dialogue) . 

The first mode is suited for control and command entry and for 
input of single word data, the second mode is suited for entry of 
letters of the alphabet or digits. The third category of speaking 
format is continuous speech and is applicable for voice entry of 
comment-like annotations or clinical protocols to a patient's 
identification records . 

A second reason why voice processing is well suited for 
identification of medical images is that the degree of speaker 
dependence is low, since the number of operators is typically low 
and almost fixed over time. 

A third reason is that the vocabulary size, and complexity is 
low to moderate. It will typically consist of a set of command and 
control words to navigate the user interface of the identification 
application by appropriate words for operations such as screen 
selection, cursor movement and key stroke shortcuts. Further, it 
will consist of sets of words for mandatory inputs such as 
examination type, sub -examination type, image destination type. 
Finally, many identification data are letters drawn from the 
alphabet, or digits such as patient's birthday (digits), patient's 
sex (letter), patient's index (digits), number of hardcopies 
requested (digit), image layout parameters (letters or digit) . 



David Vozick and James Johnson 
Serial No. : 09/924, 831 
Filed: August 8, 2001 
Page 18 



(Emphasis added) . 



Thus, the Dewaele reference teaches that speech processing is 
suitable for entering identification data for a medical image to be 
identified. The Dewaele reference simply does not disclose or 
suggest, however, converting a voice command received through a 
microphone to electronic speech data for selecting one of a 
plurality of images for viewing and image manipulation . 

Although the Dewaele reference refers to command and control, 
it is clear that the reference to command and control corresponds 
to navigation of the user interface for the identification 
function. Thus, instead of using a mouse or another pointing 
device, a user can orally specify the desired screen. 

Col. 7, lines 19-55 of the Dewaele reference, which is also 
cited in the Office Action, states as follows: 

The described system is a digital radiography system wherein a 
radiographic image is recorded on a photostimulable phosphor screen. 
The photostimulable phosphor screen is conveyed in a cassette 7. The 
cassette is provided with a radio- frequency tag 8 in which 
identification data, i.e. data concerning a patient that is 
subjected to a radiographic examination and concerning the type of 
examination that is performed etc., are stored. 

The system comprises an identification station 1, a read out 
station 2 in which the image stored in the photostimulable phosphor 
screen is read out and digitized and wherein the digital signal 
representation of the radiographic image is subjected to image 
processing. A laser recorder 3 is provided for reproducing the read 
out image . 

The system shown in FIG. 1 can be expanded to include other 
stations such as a workstation for performing off-line processing on 
the digital representation of the radiographic image and/or for 
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performing soft copy diagnosis. However, since these additional 
components are not relevant in the context of the present invention , 

they will not be described in detail. 

The identification station 1 consists of a personal computer 
(or alternatively a workstation) which is in the described 
embodiment connected to a network so as to provide access to a 
hospital information system (HIS) or a radiology information system 
9 (RIS) . 

The identification station is further equipped with a speech 
recognition/synthesis subassembly 4, with a dynamic microphone input 
5 to provide data input via speech and a speaker 10 to provide 
auditive responses. An example of a suitable speech recognition 
subassembly is a standalone board Star 21 of Lernout and Hauspie 
(Belgium) with microphone speech input and, an (AD21) DSP, speech, 
models stored in (AMD Flash) memory, RS232 connection to host, 
amplifier for synthesized TTS (Text to Speech), speech output, power 
supply. (Emphasis added) 



Thus, the Dewaele reference describes use of a photostimulable 
phosphor screen in a digital radiography system, wherein the screen 
is conveyed in a cassette, and the cassette is provided with a 
radio -frequency tag in which identification data is stored. The 
screen' is carried to a read out station where the image on the 
screen is read out, digitized and processed. The Dewaele reference 
also states that additional processing facilities may be provided. 
However, the Dewaele reference does not disclose or suggest that 
the additional processing facilities should have voice command and 
control, or say anything about them. Indeed, the Dewaele reference 
emphasizes that the "additional components are not relevant in the 
context of the present invention." 

Applicants have carefully studied the cited portions of the 
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Dewaele reference, and found no disclosure or suggestion of 
converting a user voice command to electronic speech data for 
selecting for viewing and image manipulation dental images stored 
in a storage device, as provided by the claimed invention. 

The Dewaele reference discusses, at column 9, line 3 9 through 
column 10, line 65, operations performed at the identification 
station, including entry of assorted identification information, 
such as patient's name, examination type, sub -examination type, 
comments, etc. However, the Dewaele reference simply does not 
disclose or suggest (a) processing electronic voice command data to 
cause a selected image to be retrieved from an electronic storage 
device and displayed on a monitor, and (b) manipulating the 
retrieved image according to voice commands processed through voice 
recognition. 

Col. 9, line 3 9 through col. 11, line 16 of the Dewaele 
reference is repeated below for the record for purposes of 
completeness : 

The following is a description of operations performed, along 
with details pertinent to the voice recognition functionality: 

A radiologist specific identification-screen is popped up 
either by sensing an operator's personal identification carrier to 
the read/write identification subsystem or by voice recognition of 
an utterance of the operator's name by the speech recognition 
subassembly. The database of voice patterns pertaining to the 
operator is made active. 

The patient's name is uttered by the operator to identify the 
patient to the system. On correct recognition, the name is displayed 
in the patient name field. On false recognition, an alternative 
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voice input is offered consisting of spelling the patient's name. 
During utterance of the letters of the name, the list of patients 
currently residing in the hospital as established during patient 
intake, is popped up onto the screen. The portion of the list 
displayed during spelling is continuously narrowed as more 
successive letters are recognized by the system. In addition to the 
patient name, the list also shows the running number of the patient 
in the list and' the patient's birthday. At all times during spelling 
the name, a shortening may be obtained by uttering the digits of the 
running number of the patient as soon as the data searched for 
become displayed. Both spelling of 2 6 letters of the alphabet and 
the 10 digits is far less prone to recognition error than direct 
recognition of the patients name, for reasons that the vocabulary 
of letters and digits has fixed size and can be specifically trained 
to the operator. In contrast, direct recognition of the patient's 
name is more difficult since the number of words is substantially 
large (as large as 500 e.g.) and since the voice sample of the name 
used as a reference template, has been recorded by a receptionist at 
patient intake. This person in general is different from the 
radiology operator, and patient name recognition thus has presented 
itself as a speaker independent recognition task. An acceptance 
qualifier completes the patient entry; a correction qualifiers 
offers the operator the opportunity to re-enter a name; a rub-out 
qualifier enables to erase letters in much the same way as the 
backspace key on a keyboard operates. As a fallback way of entry, 
the patient name may still be selected by cursor movement from the 
patient list or entered manually by keyboard on network failure or 
absence of a RIS database. The patient name is filled in in. its 
appropriate field, and other patient related data are retrieved from 
the RIS database to complete fields such as sex (M/F) and birthday. „ 
Should these latter items be unavailable, voice entry of them is 
task of recognition of a sequence of letters and digits. 

The system prompts the operator to input the examination type. 
The examination type is one out of a radiologist specific list of 
examination (such as thorax, pelvis, skull, . . . ) and recognition 
thus belongs to the isolated word mode. The size of the examination 
list typically does not exceed 20. On correct recognition, the 
examination type is automatically entered into the appropriate 
field. On false recognition, a list of all examination types and a 
ranking number is popped up to assist the operator in selecting the 
examination type. Selection now is done by uttering the digits (one 
or two digits) of the ranking number. Alternatively, the user may 
use cursor movements to scroll through the list and the "enter" 
button to select. 

The system then prompts the operator to input the sub- 
examination type. The sub-examination type is one out of a 
radiologist specific list of sub -examinations (e.g. "lateral", 
"frontal", . ), pertaining to the examination type just 
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selected. The size of the sub -examination list typically does not 
exceed 25 per examination, still amounting to a total number of sub- 
examinations as large as 500. However, knowledge of the examination 
type restricts the number of valid choices for the sub -examination 
in that sub -examination of other examination classes are not taken 
into consideration. This makes the recognition of the sub- 
examination more manageable. Analogously, on correct recognition, 
the sub -examination type is automatically entered into its field. On 
false recognition, a list of all examination types and a ranking 
number is popped up to assist the operator in selecting the sub- 
examination type by utterance of the corresponding digit sequence. 

Examination and sub -examination determine layout parameters as 
to how the image will be processed, printed and displayed (these 
include patient position, cassette position and exposure class) . 
These parameters are retrieved from radiologist specific internal 
data buffers and are automatically filled out in their appropriate 
fields. Should these fields be modified, the operator issues voice 
commands as to the placement of the cursor in one of these fields 
and modifies the default entry. 

The system prompts the operator to input the destination type. 
The destination type is one out of a radiologist specific list of 
preferred hardcopy and softcopy devices to send the digitized image 
to. The list typically contains smaller than 10 items. Selection 
proceeds in a way similar to that of the examination and sub- 
examination entry. Next, the number of copies on a hardcopy unit is 
entered by voice. ^ 

Optionally, the operator may enter comments in the "user info 
field as a recorded voice stream upon issuing the request "info". 
Voice data is stored along with other identification data in a 

database . - ■ - 

On completion of all fields on the identification screen, the 
system prompts the operator to write the data to the cassette 
identification carrier by means of the Read/Write subassembly on 
recognition of the action word "write" or other meaningful answers 
such as "OK" or "Yes" . 



Applicants have carefully reviewed the cited portions of 
Dewaele . Although Dewaele discloses entering orally-specified 
identification information which may be associated with a medical 
image, Applicants find no disclosure or suggestion in Dewaele of 
processing electronic speech data corresponding to a voice command 
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selecting a dental image, to cause the select ed image to be 
retrieved from the storage device and then displayed on a display 
monitor, as provided by the claimed invention. In addition, 
Applicants find no 'teaching or suggestion in the Dewaele reference 
of manipulating the retrieved image according to voice commands 
processed through voice recognition, as provided by systems, 
apparatus and methods of the present application. 

Since the Dewaele reference does not relate to voice- 
controlled image retrieval, display and manipulation, the Dewaele 
reference cannot render the claimed invention obvious. 

As the case law long has established, differences between the 
subject matter sought to be patented and the prior art may not be 
dismissed as being obvious or not patentably significant without 
some basis in scientific principle or objective support. See 
Custom Accessories, Inc. v. Jeffrey-Allan Industries, Inc. , 807 
F.2d 955, 961-962, 1 U.S.P.Q.2d 1196 (Fed. Cir. 1986); In re Soli, 
317 F.2d 941 137 U.S.P.Q. 797, 801 (C.C.P.A. 1963). Here, the 
Examiner ignored a difference between the prior art and the claimed 
invention. The difference in this instance substantially 
distinguishes the claimed invention from the Dewaele reference, and 
therefore is material to the nonobviousness of the claimed 
invention. The Examiner's dismissal of the difference is a 
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material error and, therefore, the rejection of the claims should 
not stand. 

Accordingly, the Examiner has failed to establish a prima 
facie case of obviousness because the Examiner has not considered 
and addressed pertinent differences between the prior art and the 
claimed invention . 

b . The Examiner has not shown a teaching or 
suggestion in the prior art or a motivation 
otherwise to modify the Dewaele reference in a 
manner that renders the claimed invention 
obvious 

It is submitted that the Examiner has not identified the 
required teaching or suggestion in the prior art or motivation 
otherwise that would lead one of ordinary skill in the art to 
modify the teachings of the Dewaele reference, which relates to 
entry of identification data for a medical image, so as to render 
the claimed invention obvious. 

The Examiner suggests that the claimed invention is 
unpatentable because the only difference between the Dewaele 
reference and the claimed invention is that the Dewaele reference 
relates to identification of medical images and the claimed 
invention relates to dental imaging. 

As discussed above, there are several pertinent differences 
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between the prior art and the claimed invention. 

As the Federal. Circuit recently reiterated in In re Thrift , 
298 F.3d 1357, 1366 (Fed. Cir. 2002) and In re Lee , 277 F.3d 1338 
(Fed. Cir. 2002) , it is an improper basis for concluding that an 
invention would have been obvious if the Examiner does not (1) 
provide proper consideration of the differences between the claimed 
invention and the cited art, and (2) provide objective support, 
such as disclosure or suggestion in the prior art (as opposed to 
conclusory statements or subjective belief) , which would lead one 
skilled in the art to modify the teachings of the cited reference 
in the manner alleged. 

Therefore, the mere fact that the prior art may be modified in 
the manner suggested by the Examiner does not make the modification 
obvious unless there is a teaching or suggestion in the prior art 
itself to make that modification. See In re Rouffet , 149 F.3d 
1350, 1355, 47 U.S. P. Q. 2d 1453 (Fed. Cir. 1998); In re Fritch , 972 
F.2d 1260, 1266, 23 U.S.P.Q.2d 1780 (Fed. Cir. 1992); Ex parte 
Haymond , 41 U.S.P.Q.2d 1217, 1219 (Bd. Pat. App. & Interf . 1996). 
The Examiner has not identified a teaching or suggestion in the 
prior art or a motivation otherwise for modifying the Dewaele 
reference in a manner that would render the claimed invention 
obvious . 
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Thus, as pointed out, since the Examiner did not explain the 
specific understanding or principle within the knowledge of a 
skilled artisan that would motivate one with no knowledge of the 
claimed invention .to make the modification, it must be inferred 
that the modification proposed by the Examiner could only be made 
with improper hindsight. It is well-established by case law that 
hindsight reconstruction in an obviousness analysis, by using the 
claimed invention as an instruction manual or blueprint for 
adapting the teachings of the prior art, is impermissible. See In 
re Rouffet , 149 F.3d 1350, 1357, 47 U.S.P.Q.2d 1453 (Fed. Cir. 
1998); Ex parte Haymond , 41 U. S . P .Q. 2d . 1217 , 1220 (Bd. Pat. App. & 
Interf . 1996) . The Examiner has not shown that persons of ordinary 
skill in the art, confronted with the same problems as the 
inventors and with no knowledge of the claimed invention, would 
modify the elements from the Dewaele reference in the manner 
claimed, as required by applicable case law. See In re Rouffet , 
149 F.3d 1350, 1357, 47 U.S.P.Q.2d 1453 (Fed. Cir. 1998). 

Applicants contend that the Examiner has not established a 
prima facie case of obviousness since the Examiner has not shown a 
teaching or suggestion in the prior art or a motivation otherwise 
to modify the Dewaele reference in a manner that would render the 
claimed invention obvious. 
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X. CONCLUSION 

For the foregoing reasons, Applicants submit that the 
Examiner's rejection of claims 1-18 is erroneous and respectfully 
submit that the rejection of these claims should be reversed. 

The required fee for filing an appeal brief under 37 C.F.R. 
1.17(f) is THREE HUNDRED TWENTY DOLLARS ($320.00). Applicants have 
enclosed a check in the amount of THREE HUNDRED TWENTY DOLLARS 
($320.00) to cover the fee for the filing of this brief on appeal. 
If any additional fee is required, authorization is hereby given to 
charge the amount of any such fee to Deposit Account No. 03-3125. 

Respectfully submitted, 
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Examiner 
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Applicant(s) 
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Art Unit 
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The MAILING DATE of this communication appears on the cover sheet with the correspondence address - 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .1 36(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 1 33). 

- Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment See 37 CFR 1.704(b). 

Status 

1 )E3 Responsive to communication(s) filed on 14 November 2002 . 
2a)Q This action is FINAL. 2b)S This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 
Disposition of Claims 

4) ^ Claim(s) 1-18 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 4fo. 5 1 2^ ' 

5) D Claim(s) is/are allowed. j ' j 

6) K C\a\m(s) h18 is/are rejected.. j FEB 3 2003 j 5^. 

7) D Claim(s) is/are objected to. J ^_ flpp ~7 -2$' 03 - /\p 

8) D Claim(s) are subject to restriction and/or election requirements. ~ ! 

Application Papers 

9) D The specification is objected to by the Examiner. 

10) D The drawing(s) filed on is/are: a)D accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1 .85(a). 

1 1) D The proposed drawing correction filed on is: a)D approved b)D disapproved by the Examiner. 

If approved, corrected drawings are required in reply to this Office action. 

12) D The oath or declaration is objected to by the Examiner. 
Priority under 35 U.S.C. §§ 119 and 120 

13) D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 

a)DAII b)Q Some*c)D None of: 

1 .□ Certified copies of the priority documents have been received. 

20 Certified copies of the priority documents have been received in Application No. . 

3.D Copies of the certified copies of the priority documents have been received in this National Stage 
application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 

14) D Acknowledgment is made of a claim for domestic priority under 35 U.S.C. § 1 19(e) (to a provisional application). 

a) □ The translation of the foreign language provisional application has been received. 

15) D Acknowledgment is made of a claim for domestic priority under 35 U.S.C. §§ 120 and/or 121. 
Attachment(s) 

1) ^ Notice of References Cited (PTO-892) 4) □ Interview Summary (PTCM13) Paper No(s). . 

2) □ Notice of Draftsperson's Patent Drawing Review (PTO-948) 5) □ Notice of Informal Patent Application (PTO-152) 

3) □ Information Disclosure Statement(s) (PTO-1449) Paper No(s) . 6) O Other: 
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DETAILED ACTION 



Claim Rejections - 35 USC § 103 



1 . The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as 
set forth in section 1 02 of this title, if the differences between the subject matter sought to be 
patented and the prior art are such that the subject matter as a whole would have been obvious 
at the time the invention was made to a person having ordinary skill in the art to which said 
subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

2, Claims 1-18 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Dewaele (6,047,257). 

As per claim 1 , Dewaele teaches an apparatus for hands-free command and 
control of a dental imaging system having a display monitor, a microphone and 
storage device storing a plurality of dental images corresponding to a selected 
dental patient, comprising: 

a speech recognition unit which converts to electronic speech data a voice 
command received through the microphone to select one of the plurality of images 
for viewing (Fig.1, items 4, 6, 8 and 7, Col. 5, lines 1-65, Col. 7, lines 45-55); and, 

a command and control processor for the electronic speech data received 
from said speech recognition unit, wherein said command and control processor 
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causes the selected image to be retrieved and displayed on the display monitor 
(Col. 9, line 39 - Col. 10, line 65). 

Dewaele while teaching displaying medical images in the field of Radiology, 
so that the attending physician can make his or her diagnosis and transcribed in 
response to voice commands recognized by the speech recognizer, do not 
specifically teach in the field of dentistry. It would have been obvious to one with 
ordinary skill in the art at the time of invention to implement the method and 
apparatus as taught by Dawaele in the medical field to the field of dentistry 
because, one would readily realize that by using speech recognition to display 
plurality of dental images would provide the hands free environment to the user 
and also have the data needed. 

As per claims 2-14, Dewaele teaches the method of claim 1, further 
comprising manipulation of images corresponding to a dental patient (Fig. 1 , items 
4, 6, 8 and 7, Col.5, line 1- Col. 6, line 6, Col. 7, lines 19-55, Col. 9, line 39 - 
Col. 10, line 65). 

Claims 1 5-1 6 are similar in scope and content of claim 1 , and are rejected 
under similar rationale. 

Claims 17-18 are method claims to be implemented on the apparatus 
claimed in claims 15-16, and are similar in scope and content, and are rejected 
under similar rationale. 
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Response to Arguments 



3. Applicant's arguments filed 1 1 /1 4/2002 have been fully considered but they 
are not persuasive. Applicants' argue that "Dawaele'257 does not disclose or 
suggest, however, retrieval for display of a computer stored dental (or medical) 
image based on a voice command to retrieve the image which is detected through 
speech recognition". Examiner disagrees. Dewaele does retrieve and use images 
stored in diagnosing or analyzing patient data (Col. 5, line 44 - Col. 6, line 6), and 
transcribing it. Dewaele identifies medical images through speech recognition by 
accessing them when needed from a storage database, identifiable using the 
patient's particulars. 

In response to applicant's argument that Dewaele '257 does not suggest 
that hands-free operation implemented through speech recognition minimizes the 
risks of contamination, a recitation of the intended use of the claimed invention 
must result in a structural difference between the claimed invention and the prior 
art in order to patentably distinguish the claimed invention from the prior art. If the 
prior art structure is capable of performing the intended use, then it meets the 
claim. In a claim drawn to a process of making, the intended use must result in a 
manipulative difference as compared to the prior art. See In re Casey, 152 
USPQ 235 (CCPA 1967) and In re Otto, 136 USPQ 458, 459 (CCPA 1963). 
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It would have been obvious to one with ordinary skill at the time of invention 
that the use of speech recognition to access patient data, or in the instant 
application, dental images in the form of data clearly shows the advantage of 
hands-free operation, whether for minimizing the risks of contamination or to 
facilitate the use of both hands to be free to do whatever the user wants to do 
with the hands, either by a dentist or a radiology technician or a physician. 

Conclusion 

4. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Lai et al., ("Medspeak : report creation with continuous speech recognition", 
Conference proceedings on Human factors in computing systems, 1 997, ACM 
Press, pages 431-438). 

Krapichler et al., ("Virtual reality and multimedia human-computer interaction in 
Medicine", 1998 IEEE Workshop on Multimedia Signal processing, pages 193-202). 
Guerrouad ("Voice control in the surgery room", Images of the Twenty-first 
century. Proceedings of the Annual International Conference of the IEEE 
Engineering in Medicine and Biology, vol.3, pages 904-905). 
Teel et al., ("Voice-enabled structured medical reporting", Conference on Human 
factors and computing systems, 1998, ACM Press, pages 595-602). 
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5. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of 
time policy as set forth in 37 CFR 1 .1 36(a). 

A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply is 
filed within TWO MONTHS of the mailing date of this final action and the advisory 
action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory 
action is mailed, and any extension fee pursuant to 37 CFR 1 .136(a) will be 
calculated from the mailing date of the advisory action. In no event, however, will 
the statutory period for reply expire later than SIX MONTHS from the mailing date 
of this final action. 

Any inquiry concerning this communication or earlier communications from 
the examiner should be directed to Vijay B. Chawan whose telephone number is 
(703) 305-3836. The examiner can normally be reached on Monday Through 
Thursday 7-4. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Marsha Banks-Harold can be reached on (703) 305-4379. 
The fax phone numbers for the organization where this application or proceeding is 
assigned are (703) 872-9314 for regular communications and (703) 872-9314 for 
After Final communications. 
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Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to the receptionist whose telephone number is (703) 
305-4700. 

Vijay B. Chawan 
Primary Examiner 
Art Unit 2654 

vbc 

9 ~ 9nn o VUAYCHAWAN 
January zj, zvvj PRIMARY EXAfciNER 



Notice or Keterences uireu 


Application/Control No. 
09/924,831 


Applicant(s)/Patent Under 
Reexamination 
VOZICK ET AL. 


Examiner 
Vijay B. Chawan 


Art Unit 
2654 


Page 1 of 1 



* 




Document Number 
Country Code-Number-Kind Code 


Date 
MM-YYYY 


Name 


Classification 




A 


us- 










B 


us- 










C 


us- 










D 


us- 










E 


us- 










F 


us- 










G 


us- 










H 


us- 










I 


us- 










J 


us- 










K 


us- 










L 


us- 










M 


us- 












FOREIGN PATENT DOCUMENTS 


* 




Document Number 
Country Code-Number-Kind Code 


Date 
MM-YYYY 


Country 


Name 


Classification 




N 














0 














P 














Q 














R 














S 














T 














NON-PATENT DOCUMENTS 


★ 




Include as applicable: Author, Title Date, Publisher, Edition or Volume, Pertinent Pages) 




U 


Lai et al., ("Medspeak : report creation with continuous speech recognition", Conference proceedings on Human factors in 
computing systems, 1 997, ACM Press, pages 431-438). 




V 


Krapichler et aL, ("Virtual reality and multimedia human-computer interaction in Medicine", 1998 IEEE Workshop on Multimedia 
Signal processing, pages 193-202). 




w 


Guerrouad ("Voice control in the surgery room", Images of the Twenty-first century, Proceedings of the Annual International 
Conference of the IEEE Engineering in Medicine and Biology, vol.3, pages 904-905). 




X 


Teel et al., ("Voice-enabled structured medical reporting", Conference on Human factors and computing systems, 1998, ACM 
Press, pages 595-602). 



A copy of this reference is not being furnished with this Office action. (See MPEP § 707.05(a).) 
Dates in MM-YYYY format are publication dates. Classifications may be US or foreign. 



U.S. Patent and Trademark Office 
PTO-892 (Rev. 01-2001) 



Notice of References Cited 



Part of Paper No. 7 




United States Patent and Trademark Office 



UNITED STATES DEPARTMENT OF COMMERCE 

United Stoics Potent and Trademark Office 

Addrew: COMMISSIONER OF PATENTS AND TRADEMARKS 

Washington, D.C. 2tttUl 

www . tmpto.ituv 



FILING DATE 



FIRST NAMED INVENTOR 



I A TTORNEY DOCKET NO. | CONFIRMATION NO. 
65252 2876 



09/924,831 



08/08/2001 



David Vozick 



7590 



04/04/2003 



Norman H. Zivin 
Cooper & Dunham LLP 
1185 Avenue of the Americas 
New York, NY 10036 



EXAMINER 



CHAW AN, VIJAY B 



ART UNIT 



PAPER NUMBER 



2654 

DATE MAILED: 04/04/2003 



I/O 



Please find below and/or attached an Office communication concerning this application or proceeding. 



PTO-90C (Rev. 07-01) 



45 



~7 r- 



Advisory Action 



Application No. 

09/924,831 



Examiner 
Vijay B. Chawan 



Applicant(s) 
VOZICK ET AL. 



Art Unit 

2654 



„77»e MAILING DATE of this communication appears on the cover sheet with the correspondence address - 

THE REPLY FILED 28 March 2003 FAILS TO PLACE THIS APPLICATION IN CONDITION FOR ALLOWANCE. 
Therefore further action by the applicant is required to avoid abandonment of this application. A proper reply to a 
fSS^S^w^R\^^i onjy be either: (1 ) a timely filed amendment which paces the application in 
cSndffior ^anSSce; (2) a timely filedNotice of Appeal (with appeal fee); or (3) a timely filed Request for Continued 
Examination (RCE) in compliance with 37 CFR 1.114. ^ ^ 

PFRIOD FOR REPLY [check either a) or Wrf.fa^ Jj : j/^t/o b 

a) ffl The period for reply expires 3_months from the mailing date of the final rejection. * p ; 

b) □ The period for reply expires on: (1) the mailing date of this Advisory Action, or (2) the date set forth in the final rejection, whichever is later. In 

no event however will ithe statutory period for reply expire later than SIX MONTHS from the mailing d^tf the ftaal^on. 
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ee under 37 CFR1 1 7(a) is calculated from: (1) the expiration date of the shortened statutory period for reply originally set ,n ttw final Office action; or 
(1) as sel forth in (bj above, if checked. Any reply received by the Office later than three months after the mailing date of the final rejection, even if 
timely filed, may reduce any earned patent term adjustment. See 37 CFR 1.704(b). 

1 □ A Notice of Appeal was filed on . Appellant's Brief must be filed within the period set forth in 

37 CFR 1 .1 92(a), or any extension thereof (37 CFR 1 .191 (d)). to avoid dismissal of the appeal. 
2.D The proposed amendment(s) will not be entered because: 

(a) □ they raise new issues that would require further consideration and/or search (see NOTE below); 

(b) □ they raise the issue of new matter (see Note below); 

(c) □ they are not deemed to place the application in better form for appeal by materially reducing or simplifying the 

issues for appeal; and/or 

(d) □ they present additional claims without canceling a corresponding number of finally rejected claims. 

NOTE: . 



3-D Applicant's reply has overcome the following rejection(s): 



would be allowable if submitted in a separate, timely filed amendment 



4. D Newly proposed or amended claim(s) _ 

canceling the non-allowable claim(s). 

5. E The a)D affidavit, b)D exhibit, or c)S request for reconsideration has been considered but does NOT place the 

application in condition for allowance because: See Continuation Sheet. 

6. D The affidavit or exhibit will NOT be considered because it is not directed SOLELY to issues which were newly 

raised by the Examiner in the final rejection. 

7. D For purposes of Appeal, the proposed amendment(s) a)Q will not be entered or b)D will be entered and an 

explanation of how the new or amended claims would be rejected is provided below or appended. 

The status of the claim(s) is (or will be) as follows: 

Claim(s) allowed: . 

Claim(s) objected to: . 

Claim(s) rejected: . 



Claim(s) withdrawn from consideration 



8. Q The proposed drawing correction filed on is a)D approved or b)D disapproved by the Examiner. 

9. Q Note the attached Information Disclosure Statement(s)( PTO-1449) Paper No(s). — 

10. Q Other: 
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1. An apparatus for hands-free command and control of a 
dental ' imaging system having a display monitor, a microphone and 
a storage device storing a plurality of dental images 
corresponding to a selected dental patient, comprising: 

a speech recognition unit which converts to electronic 
speech data a voice command received through the microphone to 
select one of the plurality of dental images for viewing; and 

a command and control processor for the electronic speech 
data received from said speech recognition unit, wherein said 
command and control processor causes the selected dental image 
to be retrieved from the storage device and then displayed on 
the display monitor. 

2. The apparatus of claim 1, wherein thumbnail 
representations of the plurality of dental images corresponding 
to the selected dental patient are displayed for selection by 
the user. 

3. The apparatus of claim 1, wherein the plurality of 
dental images include intra -oral images, panoramic dental 
images, FOTI images and periodontic images. 
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4. The apparatus of claim 1, wherein text, audio and video 
data are also stored in the storage device and available for 
selection to be displayed. 

5. The apparatus of claim 1, wherein the dental images are 
acquired from one of a dental computer connected device, video 
camera, digital scanner or X-ray storage device and stored in 
the storage device. 

6. The apparatus of claim 1, wherein the storage device is 
connected to a computer network. 

7. The apparatus of claim 1, wherein the storage device is 
remotely located and connected through a network. 

8. The apparatus of claim 1, wherein the command and 
control processor is remotely located and connected through a 
network. 



9. The apparatus of claim 1, wherein the microphone is 
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wireless . 

10. The apparatus of claim 1, wherein after the selected 
dental image is retrieved from the storage device and displayed 
on the display monitor, the command and control processor, in 
response to a second voice command received through the 
microphone and converted by said speech recognition unit, causes 
the selected dental image to be further processed according to 
the second voice command. 

11. The apparatus of claim 1, wherein after the selected 
dental image is retrieved from the storage device and displayed 
on the display monitor, the command and control processor causes 
a voice interface through a speaker to provide a set of options, 
for selection by a user, for further processing the selected 
dental image . 

12. The apparatus of claim 1, wherein the command and 
control processor causes a voice interface through a speaker to 
provide, a voice prompt to guide a user through selection of an 
appropriate dental image. 
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13. The apparatus of claim 1, wherein the speech 
recognition unit includes a hardware module electronically 
coupled to the command and control processor. 

14. The apparatus of claim 1, wherein the speech 
recognition unit comprises a client-server speech recognition 
system. 

15. A dental imaging system, comprising: 
a microphone; 

a display monitor; 

a storage device, wherein the storage device stores a 
plurality of dental images corresponding to a selected dental 
patient; and 

a speech recognition command unit which converts to 
electronic speech data a voice command received through said 
microphone to select one of the plurality of dental images for 
viewing, and processes the electronic speech data to cause the 
selected dental image to be retrieved from said storage device 
and then displayed on said display monitor. 
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16. The system of claim 15, wherein the microphone is 
wireless . 

17. A method of hands-free command and control of a dental 
imaging system, comprising the steps of: 

converting to electronic speech data a voice command from a 
user through a microphone to select for viewing one of a 
plurality of dental images stored in a storage device for a 
selected dental patient; and 

processing the electronic speech data to cause the selected 
dental image to be retrieved from the storage device and then 
displayed on a display monitor. 

18. The method of claim 17, wherein the microphone is 
wireless . 
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ABSTRACT 



An identification station into which data identifying a medi- 
cal image are input and by means of which the identification 
data are associated with the medical image, is provided with 
a speech recognition subassembly and a microphone to 
allow data input through speech recognition. 

5 Claims, 2 Drawing Sheets 
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IDENTIFICATION OF MEDICAL IMAGES 
THROUGH SPEECH RECOGNITION 

The application claims the benefit of U.S. Provisional 
Application No. 60/045,873 filed May 7, 1997. 5 

DESCRIPTION 

1. Field of the Invention 

The present invention is in the field of medical imaging. ^ 
The invention relates to identification of medical images, 
more specifically of radiographic images. 

2. Description of Prior Art 

When a medical image of a patient is to be produced, a 
number of u nification data are to b e associated with said 
image. AmonTsuch data the most relevant are the data 
T rtentifvinP the patient to which the irf}??* iw.riains and the 
TTaTalacmifring the examination type that is performed or is 
going to be performed. Other data that are commonly 
"associated with a medical image are the name of the 20 
radiologist, the sex of the patient etc. 

It is nowadays practice to enter a patient's identification 
data into a data base, commonly called a hospital informa- 
tion system (HIS). At a subsequent visit of the patient, the 
data are retrieved from the hospital information system and 25 
completed. 

In most cases the data entry consists of filling out elec- 
tronic forms displayed on computer screens. 

The current way in which this data entry is performed 
requires keyboard input or item selection via cursor control 30 
keys. This way of operating is inevitably slow, requires 
correction and may therefore potentially slow down work- 
flow. Even for experienced operators it is impossible to enter 
more than 25 to 30 words a minute. ^ 

The problem becomes more severe when a mobile iden- 
tification apparatus is used, where keyboard entry is unat- 
tractive for additional reasons such as the fact that the 
mobile identification devices have too small a size to port a 
normal-size keyboard. So, small keyboards are used having ^ 
buttons that are too small to allow normal typing speed. 
Additionally, the key order is in most cases different from 
the key order on a standard keyboard. Further, the screen 
size is small so that an awkward user interface navigation is 
provoked. 45 

Mobile identification apparatuses include hand-held ter- 
minals such as PSION Workabout from Psion Ltd., palmtop 
computers and personal digital assistants. The latter devices 
sometimes feature pen input capability combined with hand- 
written recognition instead of keyboard entry. 5C 
Unfortunately, no 100-percent error free recognition is cur- 
rently available, requiring diflficult-to-operate correction 
means. Furthermore, its data input speed still remains slow. 

Mouse or trackball, another frequently employed means 
to select items on a graphical user interface, are sometimes 55 
available on portable data terminals but are awkward to 
handle during mobile operation. 

A specific medical radiographic imaging technique rap- 
idly gaining importance is digital storage phosphor radiog- 
raphy. According to this technique a radiation image, for 60 
example an X-ray image of an object, is stored in a screen 
comprising a photostimulable phosphor such as one of the 
phosphors described in European patent application 503 
702. 

In a read out station the stored radiation image is read by 65 
line- wise scanning the screen with stimulating radiation 
such as laser light of the appropriate wavelength, detecting 
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the light emitted upon stimulation and converting the emit- 
ted light into a digital signal representation that can be 
subjected to different kinds of image processing techniques. 

The original or enhanced image can then be transmitted to 
a hard copy recorder for reproduction of the image on the 
film size and lay-out of the radiologist's choice and/or it can 
be applied to a monitor for display. 

After read-out the residual image left on the photostimu- 
lable phosphor screen is erased so that the screen is again 
available for exposure. 

As in conventional radiography the radiographic image 
needs to be associated with a patient. 

Further, adjustment parameters for the components of the 
read out device as well as parameters to be used during 
image processing are to be associated with a radiographic 
image. Commonly the settings for the read out apparatus and 
the processing parameters are determined by associating 
with an X-ray image an identifier of the performed exami- 
nation type. With this examination type a unique set of read 
out settings and processing parameters is linked. This set is 
defined and stored (in the read out apparatus) in advance. 

The currently used patient and examination type identi- 
fication system operates as follows. An unexposed photo- 
stimulable phosphor screen is conveyed in a cassette that is 
provided with an EEPROM having a number of electrical 
contacts in a fixed position on the cassette for power supp^^ 
and read-write transfer of identification data. The radiologist 
or operator performs a radiographic exposure of a phosphor 
screen in a cassette and transports the exposed cassette to an 
identification station. The identification data of the patient 
are entered into an identification program running on the__ 
identification station. This can be performed manually by 
entering the data into a personal computer of the identifi- 
cation system via keyboard entry. 

Alternatively, in case the identification station is con- 
nected to a hospital information system (HIS) or a radiology 
information system (RIS), the identification data can be 
retrieved from that information system. 

An examination type identifier is entered manually into 
the identification station by selecting a specific examination 
type (and subtype) out of a hierarchically popped up menu. 

Then, the patient identification data and the examination 
type identifier are written into the EEPROM on the exposed 
cassette by means of dedicated hardware linked to the 
identification station's personal computer. Further details on 
this procedure as well as on the outlook of the cassette are 
described in U.S. Pat. No. 4,960,994. 

The exposed and identified cassette is then fed into a read 
out station that is provided with means for reading out the 
data stored in the EEPROM and for storing these data in a 
central memory and with means for reading the radiographic 
image stored in the photostimulable phosphor screen. 

The examination type read out of the EEPROM controls 
selection of corresponding parameters for set up of the read 
out electronics as well as for the image processing to be 
performed on the read out image. These parameters were 
stored in advance in a look up table in the memory of the 
read out apparatus following a customization procedure as 
has been described in European patent application 0 679 
909. Next, variable contents of the EEPROM are erased 
whereas fixed contents are kept or updated. 

The image in the screen is read out and subjected to 
processing taking into account the read -out settings and the 
processing parameters corresponding with the identified 
examination type. 
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Alternatives to the above method have been developed A speech recognition subassembly commonly comprises: 

and are described in European patent application 0 727 696. an input for a microphone (e.g. for a condenser or 

In this patent application several embodiments of patient dynamic microphone), 

identification means such as a bar code label, a radio- an analog-to-digital converter for converting data sup- 

frequency tag, a touch memory or an EEPROM device have 5 p ]j e( ] v j a me microphone input, 

been described. A read/write terminal which is preferably a a CpT j ( an g bit microcontroller such as an Intel 8051 or 

mobile hand-held terminal is used to read the information in aQ Inld gggg can perform the task. Evidently, more 

the patient identification means and to transfer this infor- performant microprocessors can also be used), 

mation to a radio-frequency tag provided on a cassette processing means for processing data converted by the 

conveying a photostimulable phosphor screen. io analog-to-digital converter, such as a dedicated DSP 

The information stored in the different embodiments of processor (e.g. selected from the Texas TMS 320 series 

the patient identification means is either retrieved from a or AD 21 series or Motorola 56xxx or 88xxx series 

data base or manually, i.e. via keyboard entry, entered into etc ^ 

a computer and transferred from the computer, to a bar code mem ory means for data and program storage, for example 

printer or to a RF tag, or a touch memory. 1 a RQM memor y for program storage and a RAM 

Although these alternatives provide more freedom of memory for data storage, 

operation to the operator who needs to perform the identi- a powef supp j^ 

fication of a medical image, all embodiments require key- ^ mlerfacmg means such ^ a RS 2 32 connection, 

board entry at some point during the identification procedure Preferably a signa l conditioning means (this is an elec- 

and hence suffer from the already menuoned drawbacks ^ ^ sigQal amplification etc .) is 

such as low speed, correction requirement, difficult handlmg provided fof cond itioning the signal that is supplied via the 

etc - microphone input. 

OBJECTS OF THE INVENTION In one embodiment the identification station is also pro- 

-25 vided with a voice synthesis subassembly and a speaker for 

It is thus an object of the invention to provide an identi- providing auditive responses to the operator. Such an assem- 

fication station for identifying a medical image and an additionally comprises a digital to analog convenor, an 

identification method that is fast and reliable and allows tor amplifierj a speaker output and a RAM memory for storing 

handsfree operation. voice sam ples. 

It is a further object of an embodiment of the invention to 3Q Speecn recognition technology has reached the point 

provide such an identification station and such an identifi- WQere affordable commercial speech products arc available 

cation method that are adapted for use in the field of storage for desklop sys tems (see "PDAs and Speech Recognition" in 

phosphor imaging wherein an image is stored on a photo- ^drew Seybold's Outlook on Communications and 

slimulable phosphor screen conveyed in a cassette compos- Computing, Vol. 14, No. 10, pp. 9-12). 

ing a cassette identifying means such as an electronic ^ D ala enlr y spe ed is much higher than keyboard typing and 

memory. handwritten recognition. It further allows hand-free and 

Still further objects will become apparent from the eyes-free operation of the identification equipment enabling 

description hereafter. the operator to freely communicate without having to have 

„ tVTT ^ XTTIAVr physical contact with identification system for controlling 

STATEMENT OF THE INVENTION ^ PjJ^ q{ ^ ^ ^ of ft _ wilh speech 

To achieve .the above objectives the present invention synthesis or recall of previously recorded speech samples, 

provides an identification station (1) comprising means (4,5) speech technology thus enables two-way system interaction 

for entering data identifying a medical image and means solely by means of voice. 

(6,18) for associating data with the medical image, charac- Algorithmic advances and DSP (digital signal processing) 
terised in that said means (4,5) for entering data are means 45 implementation now provide means for implementing the 
for entering data through voice recognition. required voice processing on reasonable cost and reasonable 
Another aspect of this invention relates to a method of power platforms while maintaining the required accuracy for 
identifying a medical image comprising the steps of the application, 
entering identification data of said medical image into an Companies offering desktop continuous speech recogm- 
identification station, 50 tion hardware and software, include Dragon Systems in the 
associating said identification data with said medical U.S.A. and Urnout & Hauspie in ™V™to™m£of 
image, characterised in that said identification data are a speech recognition subassembly is the STAR21 stand- 
entered bv speech alone board from 1x1110,11 & Hauspie Speech ProducLs - 11 1S 
An identification station commonly comprises a personal a low cost and complexity product featunng ar .input for 
computer or a workstation running an identification pro- 55 condenser microphone, an Analog Devices AD2 1 msp58 
gram P . I. can be a stand alone station or a station that is DSP 12 Mhz signal 1 processor, SRAM ^ 
connected to a network and that provides access to a hospital program and speech model storage and RS232 co nnechon to 
information system or a radiology information system. For a host. Products designed for small hand-held devices are 
n case of manipulation in a hospital environment the offered by companies such as Advanced Recogmtion Tech- 
dentification station is preferably a portable read/write 60 nologies (ART). The SmartSpeak product of ART £ a 
5,,,i 0 „ low-cost voice recognition software package, which is inte- 
* The identification station according to the present inven- grated on a board featuring a m iCT °P b ° ne f ***J> 
tion is equipped to provide data input through voice recog- converter, a 8051 microcontroller, RAM and ROM memory 

nition and a serial RS232 in,erface - 

For this purpose the identification station comprises a 65 A strong prejudice has existed against the application of 

speech recognition subassembly and a microphone con- data input via speech for identification purposes. Speech 

nected to thfa subassembly. recognition is difficult primarily because of variability, 
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. nl fnrm _ . /i \ variability of sounds command and control words to navigate the user interface of 

restricted to a vocabulary of at most 100 smgle and (Wte* P?g« s ™ la } 0U l pa ameters (letters or di^ts). 

isolated words so that the variability of sounds ,s 10 ^^^Jf J^^ y ^ P lhe numbe r of 

limited - . . ... „ c ,„ nossible words to be recognized. The combinations of 

(2) ™*du«^ I SmLtion and sub-examination stnngs can easily exceed 
signal characterisation is limited since the input is 2Q0 However> enlty of the examination type constrains the 
always via microphone, the characteristics of which are ^ ibmiics of the su b-examination types to be 
known at design stage. Thus, the voice recognition 1$ ^ ^ l0 the ^ of sub-examinations belonging to the 
system need not be able to cope With a variety ot exarnmat i on cl ass just recognized, thereby minimizing false 
sources. recognition. 

(3) the risk of interference with background noise from Iq g CTJCTa \ y some form of task constraints in the form of 
extraneous speech or transient acoustic events is lim- formal syntax (defining which words can follow other words 
ited on a radiology department since the voice input is 20 m different contexts of the identification flow) and formal 
under software control of the application and is semantics (defining which words make sense in the current 
restricted to well defined time slots in the course of status 0 f (ri e identification operation) make the recognition 
operation. task more manageable. 

Significant advances in several technologies and applica- jhe limited size of the vocabulary to be recognized for the 

tion areas pertinent to voice processing have made feasible 25 ra diology identification task enables one to customize the 

automatic voice recognition, such as (1) smart microphones vocabulary as to language and operator. This feature is 

adapting to any acoustic environment and giving optimum implemented in a straightforward way by letting the system 

signal-to-noise ratio in noisy backgrounds (2) acoustic echo switch to the appropriate set of stored reference voice 

cancellation to provide echo-free communications (3) patterns whenever the operator identifies himself to the 

advances in algorithms and DSP implementation of these 30 identification system, either upon entry of the operator's 

algorithms providing high performance on reasonable cost name or by automatic speaker recognition of an utterance of 

platform. Although the sources of variability cannot be me operator's name. 

eliminated in general, speech recognition technology has ^ identification station according to the present inven- 

reached a point to model and handle them properly. These tion bas been designed in particular for use in connection 

models are based on (1) standard pattern recognition or (2) 35 w j t h a system wherein a medical image is stored in a 

on hidden Markov models. The first class computes a best photostimulable phosphor screen. 

match similarity score between a spectral pattern of features However, it can be applied in connection with imaging 

against a database of stored vocabulary patterns. These systems comprising other means for storing medical images 

spectral patterns model differences across different speakers sucn as radiographic film. 

and variance statistics derived over the time duration of the 40 Photostimulable phosphor screens are conventionally 

word. The second class of models calculates the highest conveyed in a cassette. In one embodiment such a cassette 

likelihood score for a probabilistic model for each word of ^ pr0 vided with a cassette identifying means, for example 

a vocabulary of words. an electronic memory device. Data identifying the medical 

Voice processing has proven to be very well suited for the j magc are me n input to an identification station according to 

purpose of identification in a hospital environment or spe- 45 tnc pre sent invention and are then transferred from the 

cifically in a radiology department for the following reasons. identification station to the memory on the cassette. 

First, the speaking format, that is the mode of speaking to Although the cassette identifying means may take differ- 

the machine has limited complexity : it will basically fall cnt forms ( e . g . bar code label), an electronic memory is very 

into one of the following categories: useful because of its storage capacity, its ability to be 

(a) isolated word recognition (each spoken command or 50 re-used, etc. A cassette for conveying a storage phosphor, 
data entity represents one single word) or comprising a memory device has been described in Euro- 

(b) connected word mode (the operator uses fluent speech pean Patent application 0 307 760. 

but with highly constrained vocabulary) or Various forms of electronic memory devices exist such as 

(c) continuous speech mode (the operator dictates phrases galvanically connectable EEPROM touch memory etc. 
^^HHiimuu l dial x 55 Devices that permit transfer of data and/or energy by 
Tl£ tat mode is suited for control and command entry radio-frequency transmission are preferred because these 

and for input of single word data, the second mode is suited devices allow identification without the need for physical 

for entry of lcttcrf of the alphabet or digits. The third connection between the identification device and the cas- 

ScJ of speaking format b continuous speech and is sette. This kind of devices is ^rtbermore very well adapted, 

applicable for voice entry of comment-like annotations or 60 for use with a mobile identification apparatus. 

cSproioeols to a patient's identification records. A device that is very well suited for such an appkcaUon 

A second reason why voice processing is well suited for is a radio-frequency tag (alternatively termed radio- 

identificationof medical images is that the degree of speaker frequency transponder). Identification procedures based on 

dependenceislow.sincethenumberofoperatorsistypicaUy the use of radio-frequency tags have been described in 

low and almost fixed over time. 65 European patent application 0 727 696 

Alhird reason is that the vocabulary size and complexity In case a radio-frequency lag is used, the identification 

is low to moderate. It will typically consist of a set of station needs to be equipped with means for transferring 
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identification data to said memory by radio -frequency trans- 11 directing hght emitted by a stimulate phosphor screen 

mission Additionally the identification station may be into the light input face of a photomultipher 12, a sample and 

equipped with means for transferring supply voltage to said hold circuit 13, and an analog to digital convenor 14. The 

memory by radio-frequcnev transmission. read out device also comprises a processing module (no 

J 5 shown) for performing online processing on the digital 

BRIEF DESCRIPTION OF THE DRAWINGS s i gna i representation of the radiation image. 

Particular aspects of the present invention as well as The operation of the read out station is as follows, 

preferred embodiments thereof will be explained by means Stimulating rays emitted by laser 15 are directed onto the 

or the corresponding drawings wherein pbotostimulable phosphor screen to scan this screen^ The 

. , . ? . u-,h «h, m ,.hnH 10 stimulating rays are deflected into the main scanning direc- 

FIG. 1 is a general v.ew ot a system in which the method SJ^ deflection means 16 . Sub . 

of the present invention can be applied ^ performcd by lransporting thc phosphor screen 
FIG. 2 is a detailed view of a system for reading an image in ^ subscanning direction indicated by arrow 17. Upon 
stored in a photostimulablc phosphor screen. stimulation, the photostimulable phosphor emits light within 
ncxATTPn np^rRTPTION 15 a second wavelength range which is different from the 
UbiAiLtu utauKir nyi wavelength range of the stimulation light. The emitted light 
A simplified diagram of a system in which the present ^ directed by means of a light collector 11 onto a photo- 
invention can be implemented, is shown in FIG. 1. multiplier 12 for conversion into an electrical image rcprc- 
Tht described system is a digital radiography system scntation. Next, the signal is sampled by a sample and hold 
wherein a radiographic image is recorded on a pholostimu- 20 circuit 13, and converted into a digital raw image signal by 
lable phosphor screen. The photostimulable phosphor screen means of an analog to digital convenor 14. The digital signal 
is conveyed in a cassette 7. The cassette is provided with a representation of the radiation image is then fed into pro- 
radio-frequency tag 8 in which identification data, i.a. data cessing module (not shown) where it is subjected to image 
concerning a patient that is subjected to a radiograpnic enhancing signal processing techniques. 
"p.Yfl minatjon and conccminp the tvoc ol ttxafflmalion Ihat is 2 5 Workflow Description 

^rmpH ".!^ am stored. The following is a description of the workflow from the 
The system comprises an identification station 1, a read identification of a radiation image pertaining to a radio- 
out station 2 in which the image stored in thc photostimu- graphic examination of a patient to the read out of the digital 
lable phosphor screen is read out and digitized and wherein image representation, 
the digital signal representation of the radiographic image is 30 FIRST EMBODIMENT 
subjected to image processing. A laser recorder 3 is provided 

for reproducing the read out image. Stationary Operation 

The system shown in FIG 1 can be expanded to include A[ t ^ slandardized dala 

other stations such as a workstation for ^tonmag ; off-line ^ ^ ^ ^ tQ ^ 

processing on the digital representation of the radiographic * y P involved hospital entities with requested patient 

image and/or for performing soft copy diagnosis. However, 4 J ^ ^ ^ dectr0Qic forms 

since these additional components are not relevant m the q{ &n idenlification station . ^ Wnd 

context of the present invention, they will not be described of PJ fe ^ ^ ^ generaUy performed by a 

in detail. 40 sma jj staff of peop i c wbo train the system to recognize 

The identification station 1 consists of a personal com- individua i worc j patterns. The task is also characterised in 

puter (or alternatively a workstation) which is in the ^ cnces of keystrokes can be replaced with a single 

described embodiment connected to a network so as to voice command or a voice macro and i t ^ thus a task that is 

provide access to a hospital information system (HIS) or a ^ suited tQ be handled by voice pr0C essing. 

radiology information system .9 (RIS). 45 ^ commonly al lhe patient recep- 

Tlie identification station is further equipped with a {[Qn desk fe ^ of accessing a database such as a RIS or 

speech recognition/synthesis subassembly 4, with a dynamic H]S ^ recogni{ioD tas k then consists of querying a 

microphone input 5 to provide data input via speech and a database l0 determine specific information concerning the 

speaker 10 to provide auditive responses. An example of a t comained within lne database . 

suitable speech recognition subassembly is la f™™°™ 50 ^ followin actions m at patient intake, the 

board Star 21 of Urnout and Hauspie B elff^^J * ^ aimed at cnabUag the 

=^ - «T*rxsx- based idenlificalion operallon in 

speech output power supply. 55 I ' lnformation System) or .HIS (Hospital 

•Hie personal computer (or workstation) * prided I with Urmilton System) by an employee of the adminis- 

a readme sub-umt 18 and an an enna 6 ^o»»^ tfative department 0 ' r Sieved by database query and 

steering electronics (not shown) for transferring data to an brou aht uo to date- 

RF lac. Additionally, a link to a bar code printer, or to a r i . , . t . , . , , , 

touch probe may be provided. Thc selection of probes or 60 (b) lhe list of currently residing patients is updated; 

connections that is provided depends on the mode of opera- (c) a voice sample of the name of the patient is uttered by 

lion .chosen by a specific hospital. lhe employee and stored along with the mdcx/patienl 

The read out station is illustrated in FIG. 2 and comprises hsl; 

a laser 15 emitting light of a wavelength adapted to the (d) patient or examination specific annotations are entered 

stimulation spectrum of the phosphor used, galvanometric 65 by voice and stored in the patient's records so as lo be 

light deflection means 16 for deflecting light emitted by the recalled by voice synthesis. To the purpose of voice 

laser onto the photostimulable phosphor screen, a light guide recognition in the ART system the voice sample is 
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digitized in the acquisition phrase by an A/D convenor, 
as small as 6 bits, and compressed into a package as 
small as 200 bytes on the average per second of 
analyzed signal, and stored in memory. Therefore, the 
RAM storage requirement does not exceed 100 KByte 5 
for 500 isolated words. The 200 Byte package is a 
cum pressed signature in vector form capturing the 
features that make a particular sound-bite unique. In the 
recognition phrase, these vectors are compared by the 
recognition engine with an input voice sample that is 10 
similarly digitized and compressed, 
(e) the patient is optionally provided with a personal 
identification data carrier such as a barcode, encoding 
the patient index, or an EEPROM based data carrier 
such as a Touch Memory or an RF-tag. 15 
Patient exposure. The cassette conveying a pootostimu- 
lable phosphor screen is exposed at an examination site by 
a radiology operator or a physician. The cassette is provided 
with an EEPROM based data carrier. In this embodiment the 
data carrier is a RF tag (radio-frequency tag). Information 2 o 
can be written onto and read from a RF tag without requiring 
mechanical contact. 

Cassette identification. The exposed cassette is then trans- 
ferred to identification station 1. The identification station 
consists of a networked personal computer, a read/write 25 
identification subassembly (6,7) to write and read data to and 
from the identification carrier of an introduced cassette and 
a speech recognition subassembly (4,5) with microphone 

input (5). . . 

The design of the identification station shown in FIG. 1 is 30 
only one example. Alternative designs arc possible. The 
apparatus may for example be provided with a slit wherein 
a cassette can be slided so that the radio-frequency tag is 
optimally positioned for wireless data (and energy) transfer. 
The speech recognition subassembly can either be integrated 35 
on a stand-alone board separately powered and connected to 
the identification station by serial link or it can be integrated 
on a plug in board in the identification station. 

The following is a description of operations performed, 
along with details pertinent to the voice recognition func- 40 
lionalily: 

A radiologist specific identification-screen is popped up 
either by sensing an operator's personal identification 
carrier to the read/write identification subsystem or by 
voice recognition of an utterance of the operator's 45 
name by the speech recognition subassembly. The 
database of voice patterns pertaining to the operator is 
made active. 

The patient's name is uttered by the operator to identify 
the patient to the system. On correct recognition, the 50 
name is displayed in the patient name field. On false 
recognition, an alternative voice input is offered con- 
sisting of spelling the patient's name. During utterance 
of the letters of the name, the list of patients currently 
residing in the hospital as established during patient 55 
intake, is popped up onto the screen. The portion of the 
list displayed during spelling is continuously narrowed 
as more successive letters are recognized by the system. 
In addition to the patient name, the list also shows the 
running number of the patient in the list and the 60 
patient's birthday. At all times during spelling the 
name, a shortening may be obtained by uttering the 
digits of the running number of the patient as soon as 
the data searched for become displayed. Both spelling 
of 26 letters of the alphabet and the 10 digits is far less 65 
prone to recognition error than direct recognition of the 
patient's name, for reasons that the vocabulary of 
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letters and digits has fixed size and can be specifically 
trained to the operator. In contrast, direct recognition of 
the patient's name is more difficult since the number of 
words is substantially large (as large as 500 e.g.) and 
since the voice sample of the name used as a reference 
template, has been recorded by a receptionist at patient 
intake. This person in general is different from the 
radiology operator, and patient name recognition thus 
has presented itself as a speaker independent recogni- 
tion task. An acceptance qualifier completes the patient 
entry; a correction qualifiers offers the operator the 
opportunity to re-enter a name; a rub-out qualifier 
enables to erase letters in much the same way as the 
backspace key on a keyboard operates. As a fallback 
way of entry, the patient name may still be selected by 
cursor movement from the patient list or entered manu- 
ally by keyboard on network failure or absence of a RIS 
database. The patient name is filled in in its appropriate 
field, and other patient related data are retrieved from 
the RIS database to complete fields such as sex (M/F) 
and birthday. Should these latter items be unavailable, 
voice entry of them is task of recognition of a sequence 
of letters and digits. 
The system prompts the operator to input the examination 
type. The examination type is one out of a radiologist 
specific list of examination (such as thorax, pelvis, 
skull, . . . ) and recognition thus belongs to the isolated 
word mode. The size of the examination list typically 
does not exceed 20. On correct recognition, the exami- 
nation type is automatically entered into the appropriate 
field. On false recognition, a list of all examination 
types and a ranking number is popped up to assist the 
operator in selecting the examination type. Selection 
now is done by uttering the digits (one or two digits) of 
the ranking number. Alternatively, the user may use 
cursor movements to scroll through the list and the 
'enter' button to select. 
The system then prompts the operator to input the sub- 
examination type. The sub-examination type is one out 
of a radiologist specific list of sub-examinations (e.g. 
'lateral', 'frontal', . . . ), pertaining to the examination 
type just selected. The size of the sub-examination list 
typically does not exceed 25 per examination, still 
amounting to a total number of sub-examinations as 
large as 500. However, knowledge of the examination 
type restricts the number of valid choices for the 
sub-examination in that sub-examination of other 
examination classes are not taken into consideration. 
This makes the recognition of the sub-examination 
more manageable. Analogously, on correct recognition, 
the sub -examination type is automatically entered into 
its field. On false recognition, a list of all examination 
types and a ranking number is popped up to assist the 
operator in selecting the sub-examination type by utter- 
ance of the corresponding digit sequence. 
Examination and sub-examination determine layout 
parameters as to how the image will be processed, 
printed and displayed (these include patient position, 
cassette position and exposure class). These parameters 
are retrieved from radiologist specific internal data 
buffers and are automatically filled out in their appro- 
priate fields. Should these fields be modified, the opera- 
tor issues voice commands as to the placement of the 
cursor in one of these fields and modifies the default 
entry. 

The system prompts the operator to input the destination 
type. The destination type is one out of a radiologist 
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specific list of preferred hardcopy and softcopy devices 
to send the digitized image to. The list typically con- 
tains smaller than 10 items. Selection proceeds in a way 
similar to that of the examination and sub-examination 
entry. Next, the number of copies on a hardcopy unit is 
entered by voice. 

Optionally, the operator may enter comments in the 'user 
info' field as a recorded voice stream upon issuing the 
request "info". Voice data is stored along with other 
identification data in a database. 

On completion of all fields on the identification screen, 
the system prompts the operator to write the data to the 
cassette identification carrier by means of the Read/ 
Write subassembly on recognition of the action word 
"write" or other meaningful answers such as "OK" or 
"Yes". 

A typical voice based identification session is the follow- 
ing sequence 



Identification System 


Operator 


"Please enter operator identification" 


"Operator Johnston" 


"Enter patient" 


"Smith" 


"Unrecognized. Please spell" 


"S", "M" 


(patient list pops up, patient Smith has 


"five", "four" 


number 54) 




"Enter examination" 


"thorax" 


"Enter sub-examination" 


"lateral" 


"Enter destination" 


"list" 


(list is popped up, LR_3 device has number 


"three" 


3) 


"two" 


"Number of copies" 


"Accept and write data?" 


"OK" 
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Identification data that were input in the identification 35 
station and an energy signal for powering the radiofrequency 
tag on the cassette are transferred through radio-frequency 
transmission onto the radio-frequency tag provided on the 
cassette. The identification procedure is now terminated. 

Digitization. After identification, the cassette is with- 4Q 
drawn from identification station 1 and entered into read out 
apparatus 2. The identification data are read out from the 
radio-frequency tag on the cassette and used for processing 
the image according to specific image processing parameters 
pertaining to the examination type. 45 

Should demographic data be unavailable on the cassette 
id -data carrier, all unknown fields are retrieved from the 
RIS/H1S database by patient record lookup. 

Hardcopy/Softcopy. Patient demographic data, examina- 
tion processing settings and radiologist name are sent along 50 
with the image to the hardcopy unit or transmitted to a 
softcopy diagnostic unit. 

SECOND EMBODIMENT 

Mobile Operation 55 

Mobile identification offers the advantage over stationary 
identification in that the identification can be performed at 
the examination site. This is particularly advantageous for 
intensive care units (ICUs) and bedside examinations (e.g. 
thorax at bed) because it considerably reduces the risk of 60 
misidentification. 

However, the operator carries both a portable identifica- 
tion terminal and one or more cassettes, and thus faces a 
manipulation problem, in addition to the problems outline 
before. Voice based data entry enables him a hands and eyes 65 
free mobile identification operation, the details of which are 
disclosed below. 



For the purpose of mobile identification, a handheld 
computer such as Psion Workabout from Psion Ltd., U.K. is 
equipped with peripherals as described in "Psion 
Workabout, Products & Markets document", such as a 
barcode scanner, a custom designed Touch Memory module 
to write/read Touch Memory buttons from Dallas 
Semiconductor, USA, and/or a custom designed RF-tag 
write/read subunit to write/read RF-tags from MIKRON 
GmbH, Austria. The terminal is equipped with microphone, 
A/D converter, microcontroller and voice recognition soft- 
ware such as SmartSpeak available from Advanced Recog- 
nition Technologies Inc., USA. The mobile identification 
modality further comprises a network of docking stations, 
connected to a host in a serial multidrop network via RS485 
or in another common network standard such as Ethernet. 
The host runs the communication software to communicate 
with the handhelds. A mobile identification session proceeds 
in much the same way as a stationary identification opera- 
tion: 

at regular time intervals an updated patient list annotated 
with patient index and a 200 byte voice sample of the 
patient name is communicated across the cradle net- 
work to all mobile terminals. Alternatively, at all times, 
the most recent list can be retrieved on request of the 
operator by a key sequence. 

The radiology operator picks up a terminal, and identifies 
himself to the system, by reading the operator's iden- 
tification means. 

Patient identification is done either by scanning the 
patient's barcode holding the patient index or by voice 
input of the patient's name. Analogous to the stationary 
identification, a similarity score between a compressed 
version of the operator's utterance of the patient name 
and all 200 Byte voice compressed samples, attached to 
the patient name is computed, and the most similar 
match determines the patient name presented to the 
operator. Should verification reveal incorrect 
identification, the patient name is spelled and a list 
narrows until no more than one patient name corre- 
sponds to the sequence of uttered letters. Again, such a 
task is much less error prone, since it represents a fixed 
and limited vocabulary recognition task. 

Examination, sub-examination and destination are recog- 
nized and entered to the system by a procedure analo- 
gous to the stationary identification. 

The cassette is identified by writing all identification data 
to the cassette's identification carrier by means of a 
read/write subunit of the portable terminal, e.g. a 
RF-tag module. 

Further characteristics of the implementation include the 
following: 

operator training and customization: This is the ability to 
input and store a voice sample of all command words 
recognized in the application for each opera tor to tune 
the system to better accuracy and robustness. At least 
the following words need be uttered once by an opera- 
tor previously unknown to the system : 26 letters of the 
alphabet 'a' . . . 'z\ 10 digits '0' . . . '9'; mnemonic 
qualifiers for control words such as 'enter', * return', 
'accept', 'reject', 'delete', 'exit\ 'escape', 'up', 
'down', 'left', 'right', 'insert', 'home', 'end', 'shift', 
'tab' and mnemonic qualifiers for action words such as 
'read', 'write', 'list', 'info'. Control words are used to 
move the cursor through the screens or through menus 
of the identification user interface, through successive 
fields on a screen or between individual characters 
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within a field. Action words are used to let I he appli- 
cation perform an action, such as writing the identifi- 
cation data to the identification carrier by means of the 
Read/Write subassembly. 

Storage of voice samples to synthesize voice prompts. 
These voice prompts consist of standard words "enter", 
"patient", "examination", "sub-examination", . . . and 
are used to reconstruct any prompt as a concatenation 
of any of these words. 

Barge-in capability, that is the ability of the operator to 
speak over the voice prompt, thereby cancelling the 
prompt. This feature is invaluable for experienced 
operators who do not need to listen to the prompt to 
know what to say to the system. Prompting may be 
switched off completely on operator, request. 

Word spotting capability, that is the ability to recognize 
cither a command word or a command sequence within 
fluent speech. 

Real-time response, that is short response time (typically 
less than 1 sec per item) for display of recognized 
letters, words or command words such that the operator 
feels in control of the actions of the system. 

To secure safe continuation, the identification application 
asks the operator to aid in error detection and correction 
whenever the recognizer is ambiguous or not confident 
of its outcome. 

To limit access to the system to authorized persons only 
and to simultaneously identify the operator for retrieval 
of the operator's customized identification settings, 
speaker verification is used. Speaker verification tech- 
nology determines whether a given speech sample, e.g. 
the operators name, was spoken by the speaker's 
claimed identity. An operator wishing to be verified 
makes an identity claim. This accesses a stored voice 
pattern for that identity. The system compares the time 
aligned speech samples of the operator with the stored 



pattern and computes a similarity or distance score. The 
degree of match can be used to control operator specific 
identification data. 

The digitization and hard/soft copy recording is identical 
5 to the procedure described higher. 

1 claim: 

1. An identification station comprising means for entering 
data identifying a medical image, means for associating data 
with the medical image, characterized in that said means for 

10 entering data are means for entering data through voice 
recognition, wherein said medical image is stored in a 
photostimulable phosphor screen conveyed in a cassette, 
having an electronic memory, and means for transferring 
identification data to said electronic memory by radio- 

15 frequency transmission. 

2. An identification station according to claim 1 wherein 
said means for entering data through voice recognition 
comprise a speech recognition subassembly and a micro- 
phone connected to said speech recognition subassembly. 

20 3. An identification station according to claim 2 provided 
with a speech synthesis subassembly and a speaker con- 
nected to said speech synthesis subassembly. 

4. An identification station according to claim 1 that is 
portable. 

25 5. A method of identifying a medical image comprising 
the steps of 

entering identification data into an identification station, 
associating said identification data with said medical 
image, characterized in that said identification data are 
30 entered into said identification station by speech, 
wherein said medical image is stored in a photostimu- 
lable phosphor screen conveyed in a cassette, having an 
electronic memory, and means for transferring identi- 
fication data to said electronic memory by radio- 
35 frequency transmission. 



